arXiv: 1503.08419v3 [math.AP] 13 May 2015 


ON A BOLTZMANN MEAN FIELD MODEL FOR KNOWLEDGE 
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Abstract. In this paper we analyze a Boltzmann type mean field game model for knowl¬ 
edge growth, which was proposed by Lucas and Moll m- We discuss the underlying 
mathematical model, which consists of a coupled system of a Boltzmann type equation for 
the agent density and a Hamilton-Jacobi-Bellman equation for the optimal strategy. We 
study the analytic features of each equation separately and show local in time existence 
and uniqueness for the fully coupled system. Furthermore we focus on the construction 
and existence of special solutions, which relate to exponential growth in time - so called 
balanced growth path solutions. Finally we illustrate the behavior of solutions for the 
full system and the balanced growth path equations with numerical simulations. 


1. Introduction 

Endogenous growth theory is based on the assumption that human capital, innovation 
and knowledge are significant factors for economic growth. Understanding how innova¬ 
tion and knowledge lead to long-term economic growth attracted a lot of interest in the 
macroeconomic literature. Different models have been proposed to describe knowledge in¬ 
crease; most of them relate innovation and/or imitation to knowledge growth. In [101 11; 
economic growth is initiated by imitation in random meetings. Koenig et al. |l_2j pro¬ 
posed a decision based models, in which firms decide between ’imitation’ or ’innovation’. 
Luttnrer pms] claims that noise and imitation lead to growth and considered a model 
in which individuals are characterized by their productivity or knowledge level. Here the 
cumulative distribution function describes the distribution of knowledge, which evolves 
as individuals meet each other, compare ideas and improve their own knowledge level. 
Individual meetings are modeled by ’collisions’ in a Boltzmann type equation for the dis¬ 
tribution function. The interaction frequency among individual agents is assumed to be 
given; agents do not decide how much time they invest in learning or working. Lucas 
and Moll m extended this approach by modeling agents as rational individuals, which 
decide between either option. Each agent’s decision is based on maximizing its individual 
earnings given the distribution of all other agents with respect to their knowledge level. 
The resulting system corresponds to a Boltzmann equation for the agent distribution that 
is coupled to an Hamilton-Jacobi-Bellman equation for the optimal strategy. This novel 
approach combines mathematical models developed in the field of kinetic equations as well 
as mean field games. 

Mathematical tools and methods from statistical mechanics such as kinetic theory have 
become a popular and successful tool in economics and social sciences. The kinetic theory 
developed by Boltzmann studies the statistical behavior of a system not in equilibrium 
and has its origins in analyzing the thermodynamics of dilute gases. The Boltzmann 
equation describes the evolution of the probability distribution function of molecules due 
to microscopic interactions, namely the collisions between particles. In socio-economic 
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applications these collisions correspond to trading events cf. EE, the exchange of opinion 
cf- OS '2 HZ, lHl f2l IT] or non-cooperative games, cf. [6] . 

In mean field models the effect of all other individuals on a single individual is replaced 
by an averaged effect. In mean field game theory the dynamics of a single individual are 
determined from a stochastic optimal control problems, in which the optimal strategy is 
influenced by the knowledge of the distribution of all other players at all times. Mean 
field game models received an increasing interest in economics, e.g. for describing the 
strategic decision making processes in large player stochastic games, cf. mm in the 
last years. Their general structure corresponds to a coupled system of a Fokker-Planck 
equation describing the evolution of the macroscopic agent density (forward in time) and 
a Hamilton-Jacobi-Bellman equation for the optimal strategy (backward in time). 
Degond et al. [6j introduced a different mean-field kinetic model for rational agents that 
act in a game-theoretical framework. This framework initially developed to describe herd¬ 
ing, has been extended to model wealth evolution cf. [5J and further investigated with 
respect to model predictive control, cf. [1J. 

In this paper we investigate existence and uniqueness of solutions to a Boltzmann mean 
field game (BFMG) model for knowledge growth, with a particular focus on the construc¬ 
tion of so called balanced growth path. The BMFG system exhibits interesting analytic 
features, such as mass accumulation of agents at the maximum initial knowledge level. 
Balanced growth path are special solutions, for which the cost function, here the overall 
production, grows exponentially in time. We discuss and identify specific assumptions and 
conditions, which allow for the existence of such solutions and illustrate the behavior with 
numerical simulations. 

This paper is organized as follows: in section [2] we present the Boltzmann mean field game 
model of Lucas and Moll and discuss specific modeling assumptions. Section [3] focuses 
on the separate analysis of the Boltzmann and the Hamilton-Jacobi-Bellman equation as 
well as the coupled system. In section [4] we discuss the existence of balanced growth path 
solutions. We conclude by presenting various numerical examples to illustrate the behavior 
of the proposed model in section [5j 

2. A Boltzmann type model for knowledge growth 

Lucas et al. m studied the following scenario: consider an economy or society with a 
constant number of interacting agents, which are characterized by their knowledge level. 
Agents can decide how much time they devote to working (by producing goods with the 
knowledge the already have) and how much to learning (by exchanging ideas with oth¬ 
ers). Each agent is characterized by its knowledge level z € X and the fraction of time 
s = s(z,t ) :Ix [0,T] -> [0,1] it devotes to learning. The interval X may correspond to 
the positive real line, i.e. X = M + , or the bounded interval with maximum knowledge level 
z, i.e. X = [0,z]. The function / = f(z,t ) describes the distribution of the agents with 
respect to their knowledge z and time t. We assume that each agent has one unit of time, 
hence the time devoted to working corresponds to 1 — s(z,t). 

The evolution of the distribution / = f(z,t ) is modeled by a Boltzmann type approach; 
individuals meet (i.e. collide) and exchange ideas and knowledge. Lucas and Moll proposed 
the following minimal interaction law to model knowledge increase. If two individuals with 
knowledge levels z and z' meet, their post-collision knowledge corresponds to 

2 = ma x(z, z‘'), 


(1) 
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i.e. the agent with the lower knowledge level matches its level with the other. Based on 
CD, the evolution of all agents / = f(z,t ) can be described by the following Boltzmann 
type equation: 

9tf(z,t) =-a(s(z,t))f(z,t) [ f(y, t)dy + f(z, t) ( a(s(y,t))f{y,t)dy (2a) 

Jz Jo 

f(z,0) = f 0 (z). (2b) 

The function a = a(s(z,t)) : [0,1] —> [0,1] denotes the learning function, i.e. the interac¬ 
tion probability of an individual with knowledge level z. We assume that the initial distri¬ 
bution of all agents /o = fo(z) is a probability distribution, i.e. it satisfies J x fo(z) dz = 1 
and fo(z) > 0 for all z € X. Note that equation 0 can be written in terms of the Heaviside 
function H = H(z), namely 


9tf(z, t ) = -a(s(z, t))f(z, i)((l - H)*f) + f(z, t)(H * (a/)). 


( 3 ) 


Another reformulation of @ is based on the cumulative distribution function (CDF) 
F(z, t ) = Jq f(x, t ) dx and reads as 


d t F(z, t) = -[1 - F(z, t)] f a(s(x, t))f(x, t)dx. 

Jo 


( 4 ) 


We assume that the working and learning time is directly related to the individual benefit; 
more precisely that the earnings y = y(z, t ) of an agent with knowledge level 2 correspond 
to the product of the time the individual spends on working, i.e. 1 — s(z,t), times its 
knowledge level 2 . Hence we have 

y(z,t) = (1 - s{z,t))z. (5) 

The per-capita production illustrates the total earning in a society and is given by 


Y (*) = Jt ~ *) dz - 


( 6 ) 


Based on the per-capita production each agent wants to maximize its earnings (discounted 
by a given temporal discount factor r € K + ), by choosing an optimal partition of its 
working respectively learning time. Then the optimal ratio of working to learning time (a 
quantity related to the work-life balance), is determined by the solution s = s(z,t) of the 
optimal control problem 

V(x,lf) = max! f [ )(i _ s (z, t))zp x (z, t)dzdt ], 

*6<S l Jt' Jo 


subject to 


dtp x (z,t) = —a(s)p x (z,t) / f(y,t)dy +f(z,t) a(s)p x (y,t ) 

Jz Jo 


dy 


with p x (z,t ') = 5 X . Here S denotes the set of admissible controls given by 


5 = {s:Ix[0,T]-> [0,1]}. 


( 7 ) 
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Then the optimal strategy can be calculated via the Lagrange functional 

rT rz 


c = 


If 


0 -r{t-t') ( I _ 


(1 - s(z,t))zp x (z,t ) 


d t p x (z,t) + a(s(z,t))p x (z,t) / f(y,t)dy 

J Z 

f(z,t ) f a(s(y,t))p x {y,t)dy 
Jo 


dzdt. 

The optimality condition with respect to / corresponds to the Hamilton-Jacobi-Bellman 
equation: 

dtV(z, t) — rV(z, t ) + max [(1 — s(z, t))z — a(s(z, t))V(z, t)(( 1 — H) * /) 

s£S 

+a(s(z,tMl-H)*(Vf))] = 0. 

The function V = V(z,t ) denotes the value function, and represents the production level 
starting from knowledge level z at time t and controlling the system until a final or infinite 
time. Altogether we obtain a Boltzmann mean field game (BMFG) of the form 

dtf{z,t ) = -a(S(z,t))f(z,t)((l -H)*f) + f (z, t) (^H * (a{S(z,t) 
d t V(z,t) - rV(z,t ) = - [(1 - S(z, t))z - a(S(z,t))V(z,t)((l - H) * f) 

+ a(S(z, t))((l — H) * (V /))], 


(8a) 

(8b) 


S(z, t ) = arg max [(1 — s(z, t))z — a(s(z, t))V(z, i)((l — H) * /) 
se5 


(8c) 


+ a(s(z,tMl-H)*(Vf))}, 
f(z,0) = fo(z), (8d) 

V(z,T) = 0. (8e) 

Lucas and Moll refer to a = a(s ) : [0,1] —> as the learning technology function, which 

may have the form 

a(s) = aos n , n G [0,1). (9) 

We conclude this section by stating three special modeling situations, which we shall 
discuss and analyze later. 


2.1. Symmetric meetings. In the initial model only one party in the interaction/collisions 
gains knowledge, the other one has no benefit. To capture symmetric meetings Lucas and 
Moll propose a modified Boltzmann type equation of the form 


/ OO 

[a(s(z, t)) + (3a(s(y, t))]f(y, t)dy 

+ f{z,t) f [a(s(y, t)) + /3a(s(z, t))\f(y, t)dy, 
Jo 


( 10 ) 


where (3 € [0,1] encodes the probability to learn from each other. The case /3 = 0 
corresponds to the original model, /3 = 1 to perfectly symmetric meetings. 


2.2. Special case: q = ao £ R + . Let us consider the BMFG model (|8]) with a given 
constant learning function a = cco € M, i.e. n = 0 in ©. Note that in this case the value 
function V = V(z,t ) is positive by definition and that / = f(z,t) is non-negative for all 
times if f(z, 0) > 0. Then the maximum of 

((1 - s(z,t))z + ao (V(z,m 1 -H)*f) + ((1 - H) * (Vf))) 
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is given by s(z, t ) = 0. In this case system © decouples and the Boltzmann type equation 
(| 8 aj) can be written in terms of the cumulative distribution function F = F(z,t), i.e. 

d t F(z, t ) = —a 0 (l - F(z, t))F{z , t). (11) 

Hence the optimal strategy V = V(z,t ) can be calculated independently, which also 
motivates the separate analysis of the Boltzmann and the HJB equation in the next section. 


2.3. Balanced growth path. Lucas and Moll postulated the existence of balanced growth 
path (BGP) solutions to system ([ 8 |), for which the production rate ([ 6 |) grows exponentially 
in time. BGPs correspond to solutions in the rescaled variables (<j>,v,cr): 

f(z,t ) = e~' yt 4>(ze^ lt ), V(z,t) = e yt v(ze~ lt ) and s(z,t ) = <r( 2 :e _7< ) (12) 

assuming the existence of a constant 7 € M + . If this is the case the production d 6 j) can be 
transformed to 

poo 

Y(t) = e 7< / [1 — cr(x)]x(f)(x)dx. 

Jo 

The rescaled equations for (cj>,v,a) = (J)(x),v(x),a(x)) read as 


px poo 

- l4>(x) - 7 4>'{x)x = <j>(x) / a(a(y))4>(y) dy - a(a(x))(j>(x) / 4>(y) dy , 

JO Jx 

(■r — j)v(x) + 7 u , (x)x = max < (1 — a)x + a(cr) / [v(y) — v(x)\J>{y) dy 


(13a) 

(13b) 


where S = {a : M + —>• [ 0 , 1 ]} denotes the set of admissible controls. 


3. Analysis of the Boltzmann mean field model 

In this section we present a local existence result for the fully coupled Boltzmann mean 
field model Q. We start with the analysis of the Boltzmann equation d2|) for a given 
interaction rate a = a(z,t). Then we discuss existence and uniqueness of solutions for the 
Hamilton Jacobi equation and conclude the section by studying the fully coupled system. 

In this section we will need the following assumptions on the initial datum /q = fo(z): 

(Al) Let /o € L°°(Z) be a probability density, i.e. J x fo(z)dz = 1 and fo(z) > 0 for all 
z£l 

3.1. Analysis of the Boltzmann equation for a given learning function a. We 

start with the analysis of the Boltzmann type equation (J2]) for a given learning function 
a = a(z,t). Hence we consider 

dtf(z, t ) = -a(z, t)f(z , t) J f(y, t ) dy + f(z, t ) 
f{z,0) = f 0 (z), (14b) 

on the interval X = [0, z]. 

Remark 3.1. Let us introduce the operators G s = a(s(z,t)) fj f(y,t) dy and 
L . = Jo a(s(y,t))f(y,t) dy. Then equation (fTD) can he written as 

dtf(z,t) = f(z,t ) (. L s f(z,t ) - G s f(z,t )). 


/ 

Jo 


u(y,t)f(y,t)dy, 


(14a) 
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Note that G s and L s are adjoint operators, i.e. L* = G s since 

rz pz 

( L s f,g)= [ a(s(y,t))f(y,t)dy\g(z,t)dz 

Jo Jo 

= [ [f 9 (z)dz\a(s{y,t))f(y,t)dy. 

JO Jy 

First we present a global existence result for (1141) in time. 

Theorem 3.2. Let \(A 1 j| he satisfied and a = a(z,t) £E L l (T) x L°°([0,T]). Then equation 
(fTTl) has a global in time solution f = f(z,t ) £ L 1 (T) x L°°([0, T}). 

Proof. Let T > 0 be given, a := max a and A the following closed subset: 

A = {ge C([0, T],L 1 (Z)),g > 0, ||<?M)||i < a}, 

where a = cf fo{y) dy. For each g £ A we define the operator 4>(g) as the solution of 

dtf(z,t) = ~a(z,t)f(z,t)[ g(y,t) dy + f(z,t) f a(y,t)g(y,t)dy, (15) 

J z Jo 

with initial data f(z, 0) = fo(z). Then the existence of a solution follows from Picard 
Lindeloef by showing that the operator 

(a) maps A onto itself, 

(b) is a contraction for T small. 

A priori estimates: The change of the total mass can be estimated by 

^ J f(y, t) dy < a\\g\\ i j f(y , t) dy. 

Based on this estimate we use Gronwall to deduce that 

/ f{y, t) dy < exp(d f Iblli^s) f fo(y)dy. 

Ji Jo Jx 

Hence (a) is satisfied. To show that <F is a contraction we consider equation (1151) for two 
given functions g\ and g 2 , gt £ A, i = 1,2. Then their difference satisfies 

d t (fi - f 2 ){z,t) = - a(z,t)fi(z,t) [ gi(y,t)dy + h{z,t) f a(y,t)g 1 (y,t) dy 

Jz JO 

rz pz 

+ a(z,t)f 2 (z,t) g 2 (y,t)dy - f 2 (z,t) a(y,t)g 2 {y,t) dy. 

Jz Jo 

Using H/ 2 ||i < ||/o||ie aaT =: Ki oc , for the id-norm of the difference, we obtain the differ¬ 
ential inequality 

dt\\fi ~ / 2 II 1 < 2aa||/i — f 2 \\i + 2Ki oc a\\gi — g 2 \\i- 

Since fi(z, 0) = f 2 (z, 0) we deduce, using Gronwall, that d> is a contraction for sufficiently 
small time T. Note that all constants in the local existence argument depend on the initial 
mass only. Since we have mass conservation, i.e. 

J f(z , t) dz = j fo(z) dz for all t > 0, 

we can iterate the local existence argument at T, 2T,... to obtain global existence. □ 


Next we show that the support of / remains bounded if the initial datum /q has a compact 
support. 
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Proposition 3.1. Let a £ C([0,T) xl) and f = f(z,t ) be a continuous solution to (ThO) . 
i.e. f £ C([0,T) x I) with suppf(-, 0) C [0 ,M], M < z. Then 

supp(f(-,t)) C [0, M] for all times t > 0. 

Proof. The proof is based on the maximum principle. Let us assume there exists a point 
z in (M,z\ such that f(z,t ) > 0. Then 0 < dtf(z,t). But since f(z, 0) = 0 for all z > M, 
we deduce that f(z,t ) = 0 for all z > M. j? □ 

Note that Lemma 13.11 is only valid for continuous solutions. However we expect that 
solutions of ([lip converge to Dirac deltas as time evolves. This can be explained by the 
fact that individuals with a lower knowledge level gain knowledge in each collision, but 
individuals with the greater knowledge level cannot improve. Hence we conjecture the 
formation of Dirac deltas at z = M, if supp(/) C [0, M]. This can be observed in the 
evolution of the first order moment, i.e. 

It Jx Z ^ z ^ dz = Jj- z ^ z,t ^ Jo a ( y,t }f( y } dy ~ f a ( z ^)f(v^) zd y] dz 

= I f(z) J a(y,t)f(y,t){y-^dydz, (16) 

>o 

where we used the fact that L s and G s are adjoint operators (see Remark 13.ID . Hence we 
deduce that the first order moment is increasing in time. Also the mass located in the 
interval (zo,z) is increasing, because 

7 /»2 /»2 PZ PZ 

4 / f( z i t)dz = / [ f (z, t)f (y, t)a(y, t)dy — / f(z,t)a(z, 

U'*' Jzq J zo J 0 Jz 

P Z P Z P z 

= [ f(z,t)f(y,t)a(y,t)dy- f(z,t)f(y, 

J zo JO J zo 

zo 

f(z,t)f(y,t)a(y,t)dydz > 0. 

From the previous estimates we deduce the following theorem. 

Theorem 3.3. Let a(z,t) > a > 0 and z £ supp(f), then 

/(■) t) ^*6*. 

Proof. Setting zq = 0 in equation (fT71) gives ^ Jf f(z,t)dz = 0 and therefore implies mass 
conservation. From (fI7j) we deduce that 

-^ F i z o,t) = ^(1 - F(zo,t)) > a j f(z,t)dzF(zo,t ) = a(l - F(z 0 ,t))F(z 0 ,t). 

This differential inequality implies that the CDF F(zo,t) —> 0 for zq < z as time t —> oo. 
Since we can choose zq close to z we conclude that / converges to a Dirac mass. □ 

Remark 3.4. Note that the formation of a Delta Dirac mass accumulates at z = z, where 
z = max, supp(f) for compactly supported initial datum and z = z for positive initial data 
on the bounded domain T = [0, z]. If fo(z) > 0 for all z £ M + the mass accumulates at 



t)f(y,t)dy _| dz 
t)a(y, t)dy] dz 

(17) 


z = oo. 
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3.2. Analysis of the Hamilton-Jacobi Bellman equation. Next we study the an¬ 
alytic behavior of the Hamilton-Jacobi-Bellman equation for a given / £ (7(0, T, L 1 ) on 


J = M+: 


d t V(z,t ) - rV(z, t) 


max [(1 — s(z, t))z — a(s(z, t))V(z, t)(( 1 — H) * f) 
seS 


(18a) 


V(z,T) = 0. (18b) 

We shall need the following assumption for the terminal value function in the rest of the 
section: 

(A2) Let the final data V(-,T) in equation (|18li be non-negative and non-decreasing. 

To ensure the existence of a maximizer we need the following assumptions on the learning 
function a = a(s): 

(A3) Let a : [0,1] —> M + , a £ (7 oo ([0,1]), a(0) = 0, c/(0) = oo, a" < 0 and a monotone. 
In the following we shall use the variable 

B = -V(z,t)( 1 - H) * f + (1 - H) * (Vf), 

to enhance readability. 

Lemma 3.5. Let assumption be satisfied, z > 0 and B £ R. Then there exists a 
unique solution S = S(B) of the optimization problem 

max ((1 — s)z + a(s)B), (19) 

s£S 

with S = arg max s es((l - s)z + a(s)B). 

Proof. Let ft := a -1 , then the problem with Q = a(s) is equivalent to 

arg max z(l — Bit)) + Bcr 
Ce[o,a(i)] 

Because of the strict concavity of —/3, there exists a unique solution. □ 

Lemma 3.6. Let assumption \ (A 7)| be satisfied, z > 0, B £ M and S = S(B ) be the optimal 
control satisfying (fl9l) for a given B. If 

lim a”(S(B))B 3 < 0, 

£-U) 

then the maps B —> S(B), B —> a(S(B)) and B —> a(S(B))B are Lipschitz. 


Proof. We distinguish between the following three cases: 
Case 1: If B < 0, then s = 0. 


Case 2: If B > 


a'(l) 


and since a is concave we deduce that 


z{ 1 — s) + Ba{s ) < z(l — s) + -Ba(l) + Ba'(l)(s — 1) 

= Ba{ 1) + (1 — s)(z — 5^(1)) < Ba( 1). 


Hence the maximum is attained at s = 1. 
Case 3: If 0 < B < 


TIJ 


let a'(s) = fna(s) and a"(s) = -j^a(s). In this case there exists 


a unique solution of ^ct(s) = which gives the maximum. Furthermore we have that 


and 


T o{s ) 5 '( B ) = ~ ^ ,'(B) = -g^-y 


= s “ (s)5 ' (b) = a ' <s)s ' (B) = f s ' m = 


a 


./3 


za" 


( 20 ) 
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Because lim^^o — B 3 a"(B) > 0, we conclude that S is piecewise continuously differen¬ 
tiable. Since 


lim —B z a(B) = oo, 

o 

we deduce that S'(B) —> 0 as B —» 0. Hence we have continuity at B = 0 and B = 
Lipschitz continuity follows for S(B). The same is true for a(S)B. □ 


Lemma 3.7. Let assumption (A2) be satisfied. Then the value function V(-,t) solving 
(fTHl) is non-negative and non-decreasing for all times t £ [0, T). 


Proof. Equation (118)) can be written as 

d t V(z,t) - rV(z,t) = -(1 - S(z, t))z 


roo poo 

/ V(y,t)f(y,t)dy-V(z,t) / f{y,t)dy 

J Z J Z 


( 21 ) 


POO POO 

V(zo,t) / f(y,t)dy-V(z 0 ,t ) / f(y,t)dy 

j ZO J zo 


— a(S(z, t)) 

In a minimum point zq we have 
d t V(z 0 ,t)-rV(zo,t) < -(l-S(z 0 ,t))z 0 -a 

J ZO •> zo 

Since this inequality is backwards, it preserves non-negativity. 

Calculating the derivative of equation (121)) with respect to z, we obtain 

d t V z (z,t) - rV z (z,t ) = S z (z, t)z + S(z,t) - 1 

POO 

-a'(S(z,t))S z (z,t)B + a(S(z,t))V z (z,t) / f(y,t)dy. 

J Z 

Using of = z/B , it follows that 

/ OO 

f(y,t)dy. 

Since S < 1 we deduce that V z stays non-negative. □ 


Lemma 3.8. Let assumption IfASjj be satisfied. Then B(-,t) is non-increasing and the 
maximizer S(-,t) is non-increasing for all times t £ [0,T). Moreover the function S(-,t ) 
is strictly decreasing on the interval where 0 < S(-,t) < 1, except in the degenerate case 
of -6(0, t) = 0. Then S(z,t) = 1 for small z. 


Proof. We have already seen above that B(z,t ) > 0 and 

POO 

B z = —V z (z, t) / f{y,t)dy< 0. 

J Z 

If B(0,t ) = 0, then S(z,t ) = 1 for all z. Otherwise B(0,t) > 0, so B > and 

therefore the maximizer S(z,t ) is equal to 1 for small z. If 0 < B < , we have 

S(z,t) = (c/) -1 (-§)• Since a is strictly concave, S(z,t ) is strictly decreasing. □ 

The previous results lead to the following existence and uniqueness theorem for the 
Hamilton-Jacobi-Bellman equation. 

Theorem 3.9. Let f £ (7(0, T, L l ) be given and a = a(s,t ) satisfy assumption ^(A3 )| 
Then there exists a unique solution V £ (7(0, T, L°°) of (fT8l) with V(z,T ) = 0. Moreover, 
let V be a solution of (fTHT) with f, then there exist constants m and D (independent ofV 
and f) such that 

||U-U|| 00 <De mt ||/-/|| c(0)TiL i ) ||U|| 00 


(22) 
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Proof. The proof is based on the following statements: 

(i) V -+ B(V, /) = (1 — H) -k (Vf) — V(1 — H) * / is Lipschitz on the spaces given. 

(ii) B(V) — > S and B(V) -+ a(S)B are Lipschitz because of Lemma (13.61) . 

Then equation (I18D is of the form 

d t v = R(V), 

with R Lipschitz and we can conclude the proof with Picard-Lindeloef. For the difference 
V — V we obtain 

dt(V - V) = R{V) - R{V) = R(V) - R{V) + R(V) - R(V). 

Since the estimate || B(V,f) - B(V,f )||oo < ||V^||oo||/ - f\\c(o, t,l+ holds, we can show 
that 

\\R(V) - R(V)\\ < K\\f - f\\\\V\\oo 

with K independent of V, f, f. Therefore we conclude 

||F-^||oo<^e mi ||/-/|| c( 0 jTiil) ||y|| oo . 

□ 


3.3. Analysis of the fully coupled Boltzmann mean field game system. We show 
existence and uniqueness of the fully coupled system using a fixed-point argument. 
Because of the term (1 — s)z, we expect linear growth of V in z. Therefore we are looking 
for a solution in the space (7(0, T. Lf^_ z (W + )), where 


IT 00 := L^° +Z (R + ) := {u = (1 + z)w\w € L °°{K 
To compensate we use a weighted L 1 -norm for /: 


M .. ,, \ U { Z )\ 

)} with Huy£,?° := ess sup --. 

1+z 1 + z 


W 1 := L(j_(M + ) := {u =—— —\w G L 1 ^" 1 ")} with ||u|| L i := f \u{z)\{\ + z) dz. 

!+* 1 + Z I 

First we state two lemmas, which provide the necessary estimates and bounds for the local 
existence proof. 


Lemma 3.10. Let the initial datum fo = fo( z ) satisfy assumption (Al) and the learning 
function a = a(s ) assumption \ (A 3)\ Then every solution f = f(z,t) to © has a bounded 
first order moment. 

Proof. We reiterate from (1161) that 

^ L Z ^ Z ' ^ dz ~ Jj J a<yVi V ’ ^ y dydz - “ j ^ y dy - 

This gives an exponential bound for the first order moment. 

Lemma 3.11. Let V = V(z,t ) be a solution to (18b|) . Then V is bounded in Lff 
Proof. Dividing equation (l8bl) by 1 + z and changing t to —t, we obtain 


and 


□ 


'1+z 


V V z 1 

d t-r— < r T— + T— + cnB(V, f) 

1 + z 1 + z 1 + z 1 + z 

/ oo 

(Vf)(x)dx < ||D||w°o||/||iyi < C\\V\\ w ~e Kt . 


This implies 


d t - -< r 


1 + z 


-a 


1+z 1+z 1+z 




>Ce 


Kt 


□ 
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Theorem 3.12. Let assumptions (Al) (A3) be satisfied, 
fully coupled Boltzmann mean field game system (JH } on 1 = 
solution. 


If linis^o ~~ 77 — < oo, then the 
M + has a unique local in time 


Proof. For local existence, we take a g £ C(0, T, L\_ (M + )) and solve 

l + Z 

d t V(z,t ) - rV(z, t) = — max [(1 - s(z,t))z - a(s(z,t))B(V,g)\ , (23) 

seS 

for V and S. Then we solve 

dtf{z,t ) = -a(S(z,t))f(z,t)(( 1 -H)*f) + f(z,t)^H * (a(S(z,t))f)^, (24) 

for /. Let \F denote the operator mapping g to /. We show that \F is a contraction by 
considering two solutions g and g. Taking the difference of equation 112311 for g and g gives 

d t (V -V) = r(V — V) + z(S - 5) - (a(S)B(V, g) - a(S)B(V, g)). 

To show that the second term on the RHS is Lipschitz, we calculate S'(B ) = — „ = 

— . Similarly for the third term on the RHS we deduce that 

— ( aB) = a' (S)S' (B)B + a = zS' + a = + a. 

dB a" 

Since both derivatives are bounded we obtain 


d t (V - V) < r(V - V) + K\(B(V,g) - B(V,g)\. 


Let us consider the last term only. Due to the structure of B, i.e. B(V,g ) = (1 — H ) * 
{Vg) — V(\ — H) -k g , we work on the part (1 — H) * (Vg) only. The other part can be 
estimated using similar arguments. We deduce that 


(1 - H) * (Vg) - (1 ~H)k (Vg) = (1-H)* [(V - V)g + V(g - g)) 

/*oo /*oo 

= / (V-V)gdx+ / V(g-g)dx 

J z J z 


(25) 


< ||R — R||iyoo ll^lltyi + HRlIw 00 IIS' ~ ^llw 1 - 


This implies the following inequality: 

d t (V -V)< r(V - V) + 2K\\V - V\\ w ~ Mw i + \\V\\w°° h ~ §\\w i- 

So for J Q r j |g — g\\wi dt small, ]|F — V"||vy°° is small. Moreover according to Lemma (13.81) . 
5 = 5 = 1 for small z, so the Lipschitz constant coming from (12011 is bounded. Therefore 
we obtain that also the term | a(S) — a(5)| is small. This implies for equation (12311 that 
11/ — f'Ww 1 — a \\9 ~ d\\w 1 with a < 1 for T small enough. Hence the operator ’F is a 
contraction, which concludes the proof. □ 


4. Balanced growth paths 

In this section we discuss the existence of balanced growth path solutions and the con¬ 
vergence behavior towards them. Balanced growth paths correspond to solutions (fi, v, a) 
for which the production function ([ 6 ]) grows exponentially in time. We reiterate that 
the rescaled equations for the Boltzmann mean field game (JH]) in the variables (4>, v, a) = 
(fi(x),v(x), a(x)) with 

f(z,t ) = e~' yt 4>(ze~' yt ), V(z,t) = e' yt v(ze~ lt ) and s(z,t ) = cr(ze~ lt ) 


(26) 
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read as 


px poo 

fi'(x)x = fi(x) a(a(y))<t>(y) dy - a{<j{x))<j>(x) l (j>(y) dy (27a 
Jo Jx 




(r — ^)v{x) + 7 ^> , (x)x = max < (1 — a)x + a(a) / [v(y) — v{x)](j){y) dy 


(27b) 


where H = {a : M + —> [0,1]} denotes the set of admissible controls. A necessary pre¬ 
requisite for the existence of BGP solutions is the assumption that the initial cumulative 
distribution function F(z, 0) has a Pareto tail, which is given by: 

(A4) The productivity function F(z, 0) = fg fo(y)dy has a Pareto tail, i.e. there exist 
constants k,8 G M + such that 


Condition 


lim 1 F(Z,0) = k. 

Z —^OO Z 1 / (7 

in the rescaled variable reads as: 
f°° <f>(y) dy 

lim Jz \ : = k. 


(28) 


(29) 


z—^oo i/e 

Lemma 4.1. Let assumption be satisfied. Then F = F(z,t ) has a Pareto tail with 
the same decay rate 6 for all times t € [0,T]. 

Proof. Note that we can rewrite equation (QJ as 

dt [1 — F(z, t)} = [1 — F(z, t)]G(z, t) with G(z, t) := f a(s(x,t))f(x,t)dx. 

Jo 

Then the solution is given by 

1 — F(z, t) = [1 — F(z, 0)] exp( / G(z, s ) ds ). 

Jo 

Multiplication with yields 

[1 — F(z, t)] 2 1//e = [1 — F(z, O )]^ 1 / 0 exp( f G(z, s ) ds). 

Jo 

For a fixed time s the function G(z, s ) is monotonically increasing in z and bounded. 
Hence we can pass to the limit z —> oo and deduce that the function F(z,t ) also has a 
Pareto tail for all time t € [0, T\. □ 

Lemma 4.2. Let assumption (A4) \ be satisfied. Then the growth parameter 7 € R is given 
by 


POO 

7 = 0/ a(a(y))cj>(y)dy. 
Jo 


(30) 


Proof. Recall that F = ff f(y,t)dy satisfies ([3]). Using that F(z,t ) = <h(ze 7t ) we deduce 
that 


PX 

7 < F , (x)x = [1 — 4>(x)] / a(cr)(f>(y) dy. 

Jo 


(31) 


Dividing this equation by x 9 , passing to the limit x —> 00 and using assumption (A4) 
we obtain 


1 

-Igh = -k J a(a(y))<t>{y)dy. 


and subsequently formula (13(1 . 


□ 
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4.1. Existence. Next we study the existence of BGP. In the special case of a = ao we 
can show existence of solutions, for the fully coupled problem we shall provide first results 
on the existence of ( 7 ,fi) for a given (v,a). 


4.2. Special case: a = 07 . We reiterate that system 11271) decouples in the case of a 
constant learning function a = cto £ M + . Hence we solve equation (I27al) for (j) = 4>(z) 
first. Since a is constant, we have a = 0 and we can solve the equation (127b |) for v = v(z), 
giving us a global solution. 

Theorem 4.3. Let assumption \(A4)\ be satisfied and a = ao- Then there exists a unique 
BGP solution ( < h,'u,0) and a scaling constant 7 to (1771) given by 


7 = a o 0 [ f Q (z ) dz, <4>0) = / 

Ji 1 + kx 


4>(y)dy, 


kx~ x ! Q 

Proof. Note that the initial datum fo uniquely determines the scaling constant 7 for the 
BGP in the case of constant learning function a = ao. Since 

f fo(z) dz = I f(z)dz= f e _ 7 V(ze~ 7 *) dz = 

Jx Jx J 

we deduce for 7 that 

7 = Oa 0 J <j>(y) dy = 9ao j f (z, t) dz. 

For a = ao we can write equation (I27ap as 

— r y&(x)x = — ao[l — ( h(x)]<h(x). 


(32) 


We define := i.e. := 1 — i and obtain 


x& = a(4> — 1) 


with a = This equation has the solution 


4> = bx a + 1 


and 


$ = 


bx a 


bx a + 1 

Since we look for solutions which have a Pareto tail we can determine the constant b from 
the Pareto tail assumption |(A4)| and obtain 

<F =--—- 777 . (33) 

1 + kx l / e 

In the case a = «cb the maximum on the RHS of (I27b|) is attained at a = 0. In order to 
solve equation ()27bp with <f> given and <7 = 0, we consider w := vcj> which satisfies 


/»oo 

rw + 'yw'x = <px + aofi / w dy — aow 

J X 


4>dy. 


(34) 


We define W := ff° w dy and rewrite equation (1341) as 

rw + 7 w'x = Q'x + ao&W — ao&w. 


This gives: 


With H'(x) 


(7 - r)W’ - ~i{W'x)’ = &X + ao 
x<h'(x), it follows that 

(7 - r)W - 7 W'x = H + ao$W + K. 


(35) 


Equation (1351) is a first order ODE with W ( 00 ) = 0, which can be solved by considering the 
solution to the homogeneous equation and then using the variation of constants method. 
Hence we obtain a unique solution w, and therefore v = w/(f> . □ 
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For constant a we are able to prove the following results about the stability of BGPs. 

Theorem 4.4. Let assumption be satisfied, a = ao and f denote a solution to 

the original BMFG problem ([8|). Then the rescaled density ip = e yt f(xe' lt , t) converges 
pointwise to the BGP solution given by (l26l) . as t —> oo. 

Proof. The function ip(x, t) solves 

px p oo 

d t ip(x, t) - 'yip(x, t) - jd x i’(x, t)x = ip(x, t) / a 0 ip(x,t) dy - a 0 fi{x,t) / ip(x,t)dy 

Jo Jx 

and therefore its primitive ^(x,t) := ip(y,t) dy satishes 

dp&(x,t) — 7 'F(x,t) — 7 xd x ^>(x,t) = — ao[l — ^(x, t)]\H(x, t). (36) 

We define U{x~ 1 / e ,t) := x l / e \ (-^ — l) i.e Hi = 1 +fc( 1 i 1 / 9;f and deduce 
dt* = -V 2 kx~ 1/e d t U, 

d x m = m 2 kx~ l / e ^ [x- 1 / 6 -^ + x-W-'dyU) , 

\P(1 - \J) = m 2 kx~ l/d U. 

Hence we can rewrite equation (1361) as 

-d t U-^U-^yd y U = -a 0 U. 

Since ip has a Pareto-tail, U solves the equation 

dtU + ^ydyU = 0 with Uq{x) = U(x, 0) = 1. 

0 

This equation has the solution U(y,t ) = Uo(e~ yt y), hence U(y,t ) converges pointwise to 
1 for t —>• oo. □ 


4.3. Existence for the general model. We conclude the section by proving existence 
of a solution ( 7 , <fi), which has a Pareto tail for a given (a, v ). Existence of the fully coupled 
system is a challenging problem. 

Theorem 4.5. Let assumption \ (A djj hold and a £ C 1 ([0,oo)) denote a given function, 
which satisfies 

a(z) = 1 for z £ [0, zq], a'(z) < 0. 

Then there exists a 7 £ M + and a solution f> £ L 1 ([0, 00 )) to equation (|27aD . which has a 
Pareto tail. 


Proof. The function a is non-increasing and equal to 1 in the interval [0, xo], hence 
a(a(z)) = a( 1) on [0, xo]. We know from the previous subsection that in the case a = a 0 
there exists a solution <b = <f>(x), x £ [ 0 , xq ] of the form 


$(*) 


bx a 

bx a + 1 ’ 


(37) 


where a = a(l )/7 on the interval [0,xo]. That way we obtain <f>(xo). 
We rewrite m by integration by parts as 


7 $ ; (x)x = [1 — 4>(x)] 


<h(x)a(cj(x)) — f $(y)-^a{a{y))dy 

Jo dy 


(38) 
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and can obtain a solution on [xo,oo) starting from <I>(xo) using Picard-Lindelof. Next we 
check that the Pareto-tail condition is satisfied. To do so we change variables to x := x~ l ^ e 
and define <I>(x) =: 1 — K(x)x. The existence of the limit 


lim ■ 

x —^oo 


<P(XJ 


X 


.- 1/0 


is equivalent to the existence of lim^o I\ (x ). In this new variable equation 111 1 li reads as 


K'x=-k[ a(a)(K£)'d£, 

Jo 

where a is a transformed in the new variable x. Since (Kx)' is non-negative we can 
estimate the integral on the right hand side 


K'x > -K 


Jo 


d £ = —KaKx. 


Then it follows that 


— 7 t < a and therefore K(0) <- —-. 

K z 1 — axoA(xo) 


Since d>(x) — > 1 for x — > oo, we have xqK(xq) — > 0 for xq —>• 0. This means that the limit 
lim^oc exists. 

The proof follows by iterating between d> and 7 . For 9 < 1, the derivative $>' is bounded. 
So we obtain a fixed point ( 7 , cj>) satisfying equations (I27al) and (1301) . □ 


5. Numerical simulations 


In this section we present an iterative scheme to solve © numerically. First we illustrate 
the formation of blow-up solutions for the Boltzmann type model (I 8 al) . Then we compare 
the numerical solution of our scheme with the BGP calculated using the code provided 
by Lucas and Moll. For additional information on the construction of solutions and the 
numerical solver for the BGP we refer to [Hj. Finally we study the stability of BGP 
solutions with respect to initial perturbations. 


5.1. Numerical scheme. We consider a bounded domain Z = [ 0 . z]. where z denotes to 
the maximum knowledge level. The spatial discretization corresponds to N logarithmically 
spaced intervals. The temporal domain is split into equidistant time steps of size At. Let 
Zi denote the i — th grid point and tk = kAt the k-th time step. We set the initial agent 
distribution to f(x, 0 ) = fo(x) to 


k —kry 

x) = -x e e Kx . 

0 


fo(x) = 


Note that in this case the cumulative distribution function of Fq(z) = f n fo{y)dy has a 
Pareto tail, i.e. assumption |(A4)| is satisfied. Let V(z,T ) = 0 be the terminal condition of 
the optimal strategy in the iterative solver. Then the numerical results of the full model 
are based on the following iterative procedure: 

(1) Calculate the solution / = f(z, t ) of the Boltzmann type equation (ISal) by updating 
the solution via an explicit in time discretization: 


f(zi,t k+ 1 ) = f(zi,t k ) + At 


x(s(zi,t k ))f(zi,t k ) / f(y,tk)dy+ 
J Zi 

rzi 

f(zi,t k ) / a(s(y,t k ))f(y,tk)dy 
Jo 
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(a) P = 0 (b) p = 1 

Figure 1. Distribution of agents / = f(z,t) at different times starting at 
t = 0 up to t = 5. 


where the integrals on the right hand side are evaluated using the trapezoidal rule. 

(2) Solve the Hamilton Jacobi Bellman equation (I8bl) . by first calculating the max¬ 
imum via the optimality conditions. Then determine the new optimal velocity 
using the updated density distribution / = f(z,t ) and the function s. 

In particular 

max ( (1 - s)z + a(s) f {V(y, t) - V(z , t))f(y, t)dy 
s(z)es V J z 

is given by 

(a) 2 = 0: Then s = 1 since a = a(s) is a strictly monotone function. 

(b) 2 = 2: Then the integral is equal to zero, hence s = 0 gives the maximum. 

(c) 2 6 (0, z): Then s = min ({ aQ z n g )" -1 , l ), hence we cut-off s if it lies outside 

the interval [0,1]. 

(3) Go to (1) until convergence. 


5.2. Numerical simulations of the Boltzmann type equation for a given a. In 

our first example we study mass accumulation in the case of compactly supported initial 
data f(z, 0), cf. Lemma 13.31 We assume that the maximum knowledge level is denoted 
by 2 = 1 , and choose an initial agent distribution of 

. [2 for all 2 < 0.5 

f(z, 0) = < 

|0 otherwise. 

We set a = l — 2, i.e. individuals with the lowest knowledge devote all their time to ’in¬ 
teractions’, those with the maximum knowledge do not interact at all. Figure Q] illustrates 
the formation of a Dirac at 2 = 0.5 for symmetric and non-symmetric meetings. Note that 
the lines correspond to / at different discrete time steps. Furthermore we observe that the 
formation of the Dirac happens much faster in case of symmetric meetings, i.e. /3 = 1. 
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- Time dependent simulation 

- - Balanced growth path 
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(a) Transient vs. BGP (b) Linear growth 

Figure 2. Evolution of the production function Y = Y(t) in time for 
different choices of n and 6 


5.3. Numerical simulations of the full Boltzmann mean field game model. Next 
we compare our results with the numerical simulations of the balanced growth path solu¬ 
tions by Lucas and Moll. We choose the same simulation parameters in Q and (12811 : 

cto = 0.0849, n = 0.3, 9 = 0.5, k = 0.05 and r = 0.06. 


To compare the results of the two numerical solvers we fix a sufficiently large final time T, 
i.e. t € [ 0 , 200 ] and rescale the computational domain I using the parameter 7 determined 
from the BGP simulations of Lucas and Moll. The simulation results for k = 400 time 
steps and N = 1001 discretization points are depicted in Figure 2(a) Note that we can 


not expect the existence of BGP solutions in general. If we choose for example 
cto = 0.0849, n = 0.9, 6 = 0.1, k = 0.05 and r = 0.06, 


i.e. a larger value of n and a smaller value of 9 then the BGP solver of Lucas and Moll 
is not converging. The simulation results in Figure 2(a) reveal the reason why this is the 
case. Here the production function Y = Y(t) is growing linear in time, hence the ansatz 
proposed by Lucas and Moll is not satisfied. 


5.4. Stability of balanced growth path. In our final example we illustrate the stability 
of balanced growth path with respect to small perturbations of the initial data. We 
consider a perturbation of the initial distribution of the form 


foi z ) = fo( z ) +0.1(1 -z)sin(257rz)x[ 0 .i,i], 


which corresponds to a perturbation on the interval [ 0 . 1 , 1 ] that does not change the overall 
mass, see Figure 3(a) Note that the initial datum still has a Pareto tail. We set ao, n, k, 


9 and r to the same values as in the previous example and solve the system on the time 
interval t £ [0,250] using 500 equidistant time steps. Figure 0 compares the evolution of 
balanced growth path for the corresponding unperturbed initial datum, with the transient 
simulation. We observe that this perturbation does not change the long time behavior 
of the production rate. Note that the difference of the two solutions at t = 250 can be 
explained by the terminal condition V(z,t = 250) = 0 for the transient simulation. 
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Time dependent simulation 
- Balanced growth path 


(b) Evolution of the production rate Y 

Figure 3. Stability of BGP solutions 


6. Conclusion 

In this paper we present first analytic results as well as numerical simulations for a novel 
Boltzmann mean field game model for knowledge growth proposed by Lucas and Moll Ca¬ 
in this model the distribution of individuals with respect to the knowledge level evolves 
according to a Boltzmann equation. Collisions correspond to knowledge exchange, the in¬ 
dividual interaction rate is determined by maximizing the individual productivity (given 
the common knowledge of the distribution of all other agents). This gives to a coupled 
system of a Boltzmann equation and a Hamilton-Jacobi-Bellman equation. Knowledge 
growth is an inherent nature of the model, which is also reflected in the analytic results. 
The value function of the HJB equation is growing linearly in z, hence we can only provide 
local in time existence. Balanced growth path solutions, which also correspond to expo¬ 
nential growth of the production function in time, illustrate this nature as well - although 
we are not able to provide existence results of these special solutions in a general situation, 
we provide first results on their existence and stability in the case of a special interaction 
function. 

This summary gives indications about several open analytic problems which shall be ad¬ 
dressed in the near future, e.g. existence and stability of BGP solutions for the fully 
coupled system. Another point of interest corresponds to the generalization of the model. 
For example by considering a common noise via an additional diffusion term, more general 
interaction laws for knowledge growth or knowledge decay caused by forgetting informa¬ 
tion. 
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